A Novel Template Matching Approach to Speaker-Independent Arabic Spoken Digit Recognition

نویسندگان

  • Jiping Sun
  • Jeremy Sun
  • Kacem Abida
  • Fakhri Karray
چکیده

In this paper we propose a quantized time series algorithm for spoken word recognition. In particular, we apply the algorithm to the task of spoken Arabic digit recognition. The quantized time series algorithm falls into the category of template matching approach, but with two important extensions. The first is that instead of selecting some typical templates from a set of training data, all the data is processed through vector quantization. The second extension consists of a built-in temporal structure within the quantized time series to facilitate the direct matching, instead of relying on time warping techniques. Experimental results have shown that the proposed approach outperforms the time warping pattern matching schemes in terms of accuracy and processing time.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Microprocessor based Speech Recognizer for Isolated Hindi Digits

A novel method for recognition of isolated spoken words on an 8-bit microprocessor is presented. The method uses a new but simple feature vector based on the zero-crossings of the speech signal. The feature vector is the histogram of the time-interval between successive zero-crossings of the speech signal. Dynamic time warping is used to calculate a time-aligned normalized distance between the ...

متن کامل

Evaluation of Similarity Measures for Template Matching

Image matching is a critical process in various photogrammetry, computer vision and remote sensing applications such as image registration, 3D model reconstruction, change detection, image fusion, pattern recognition, autonomous navigation, and digital elevation model (DEM) generation and orientation. The primary goal of the image matching process is to establish the correspondence between two ...

متن کامل

Video Augmentation for Improving Audio Speech Recognition under Noise

For the recognition of speech, in particular spoken digits, captured in video with poor sound due to noise, we develop a novel audio-visual fusion technique that performs significantly better than utilising either audio or video signal alone. Specifically, we present an audio-visual intermediate fusion strategy to locate speaker dependant pronounced digits in continuous video recorded with soun...

متن کامل

Text-independent speaker recognition using graph matching

Technical mismatches between the training and matching conditions adversely affect the performance of a speaker recognition system. In this paper, we present a matching scheme which is invariant to feature rotation, translation and uniform scaling. The proposed approach uses a neighborhood graph to represent the global shape of the feature distribution. The reference and test graphs are aligned...

متن کامل

Speaker Independent Voice Recognition Calculator

The voice activated calculator is a speaker-independent system that is used to perform basic mathematical operations. It recognizes the isolated spoken digits from 0 to 9, and other words like plus, minus, times, equal and clear. It then performs the respective arithmetic operations, and displays the final answer on an LCD display.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012